Aside

Download a PDF of this CV

Contact

Disclaimer

Last updated on 2020-09-18.

Main

Anderson Banihirwe

I contribute to and maintain several libraries within the open source scientific Python stack, particularly around improving scalability of Python tools in order to handle terabyte-scale datasets on HPC and cloud platforms.

Education

B.S., Computer Systems Engineering

University of Arkansas at Little Rock

Little Rock, AR

2018 - 2014

Professional Experience

Software Engineer

National Center for Atmospheric Research

Boulder, CO

current - 2018-10

  • Assisted with the development, implementation, and maintenance of software contributions to Open Source packages –namely dask, xarray, intake, jupyter — used by the Pangeo community as they pertain to NCAR systems or data.
  • Assisted with the development and maintenance of Pangeo-related installations and deployments on premise High Performance Computing (HPC) systems or commercial cloud that access NCAR resources.
  • Assisted with the development and deployment of live (virtual or in-person) and online/self-paced education material, including training websites, Jupyter notebook examples, and teaching modules.

Software Developer Intern

Quansight

Austin, TX

2018-09 - 2018-05

  • Developed xndframes, a Pandas ExtensionDtype/Array backed by xnd, a container type that maps most Python values relevant for scientific computing directly to typed memory.
  • Worked on integrating cuDF - GPU dataframe library with Apache Arrow library.
  • Worked closely with a customer to port existing Postgres code base to a Dask based workflow.

Data Science Intern

First Orion

Little Rock, AR

2018-04 - 2017-11

  • Built scoring, predictive models with Scikit-learn, Dask, and Apache Spark using First Orion’s proprietary telecommunication data.

Research Intern

National Center for Atmospheric Research

Boulder, CO

2017-08 - 2017-05

  • Developed spark-xarray, a Python package that integrates PySpark and xarray for climate data analysis.

Selected Publications, Posters, and Talks

The Pangeo Ecosystem: Interactive Computing Tools for the Geosciences: Benchmarking on HPC

2019 Supercomputing Conference Workshop on Interactive High-Performance Computing

N/A

2020

  • Authored with Tina Erica Odaka, Guillaume Eynard-Bontemps, Aurelien Ponte, Guillaume Maze, Kevin Paul, Jared Baker, Ryan Abernathey.

Zarr: chunked, compressed, multidimensional arrays

2020 Cloud Native Geospatial Outreach Day

Online

2020-09

Intake-ESM – Making It Easier To Consume Climate and Weather Data

2020 ESIP Summer Meeting

Online

2020-07

Intake / Pangeo Catalog: Making It Easier To Consume Earth’s Climate and Weather Data

2020 EarthCube Annual Meeting

Online

2020-06

  • Contributed Jupyter notebook about Pangeo’s data cataloging efforts.

Perceptual Judgments to Detect Computer Generated Forged Faces in Social Media

IAPR Workshop on Multimodal Pattern Recognition of Social Signals in Human-Computer Interaction

N/A

2019

  • Authored with Suzan Anwar, Mariofanna Milanova, Mardin Anwer.

Interactive Supercomputing with Dask and Jupyter

2019 Scientific Computing with Python conference

Austin, TX

2019-07

Beyond Matplotlib - Tutorial: Building Interactive Climate Data Visualizations with Bokeh and Friends

2018 UCAR Software Engineering Assembly conference

Boulder, CO

2018-04

PySpark for “Big” Atmospheric Data Analysis

Eighth Symposium on Advances in Modeling and Analysis Using Python

Austin, TX

2018-01